Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval
ثبت نشده
چکیده
Here we offer a more detailed description of the proposed model to facilitate re-implementation. A schematic illustration of the network architecture of the proposed model can be found in Fig. 2 of the main paper. It shows that the model is a Siamese triplet network with three branches of identical architectures and shared parameters. In this section, we further describe the detailed network architecture for each branch. Table 1 shows that each network branch has 7 convolutional layers and 2 fully connected layers. The first 5 convolutional layers are the same as the Sketch-a-Net [5] and the last 2 convolutional layers are part of the proposed attention module. Note that the output of the attention module is a 7× 7 attention mask which is used to re-weight the feature map of pool5. The attention module shortcut connection (see Sec. 3.2 of the main paper) takes place before Layer No. 12 and the coarse-fine fusion shortcut connection occurs before Layer No. 14 (as detailed in Sec. 3.3 of the main paper). The final output of each branch is a 512D feature vector which is then subjected to our HOLEF loss.
منابع مشابه
Deep Multi-task Attribute-driven Ranking for Fine-grained Sketch-based Image Retrieval
With touch-screen devices becoming ever more ubiquitous, sketch holds great promise as an intuitive and efficient mode of input compared to classic alternatives. This has motivated a major revival of interest in vision-based analysis of sketches, notably in sketch-based image retrieval (SBIR). Superior to classic SBIR methods, finegrained SBIR (FG-SBIR) methods [1] are proposed to make fine-gra...
متن کاملInstance-Level Coupled Subspace Learning for Fine-Grained Sketch-Based Image Retrieval
Fine-grained sketch-based image retrieval (FG-SBIR) is a newly emerged topic in computer vision. The problem is challenging because in addition to bridging the sketch-photo domain gap, it also asks for instance-level discrimination within object categories. Most prior approaches focused on feature engineering and fine-grained ranking, yet neglected an important and central problem: how to estab...
متن کاملFine-grained sketch-based image retrieval by matching deformable part models
An important characteristic of sketches, compared with text, rests with their ability to intrinsically capture object appearance and structure. Nonetheless, akin to traditional text-based image retrieval, conventional sketch-based image retrieval (SBIR) principally focuses on retrieving images of the same category, neglecting the fine-grained characteristics of sketches. In this paper, we advoc...
متن کاملFine-Grained Image Retrieval: the Text/Sketch Input Dilemma
Fine-grained image retrieval (FGIR) enables a user to search for a photo of an object instance based on a mental picture. Depending on how the object is described by the user, two general approaches exist: sketch-based FGIR or text-based FGIR, each of which has its own pros and cons. However, no attempt has been made to systematically investigate how informative each of these two input modaliti...
متن کاملQuery-adaptive Image Retrieval by Deep Weighted Hashing
The hashing methods have attracted much attention for large scale image retrieval. Some deep hashing methods have achieved promising results by taking advantage of the better representation power of deep networks recently. However, existing deep hashing methods treat all hash bits equally. On one hand, a large number of images share the same distance to a query image because of the discrete Ham...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017